Bidirectional Online Probabilistic Planning
نویسندگان
چکیده
We present Bidirectional Online Probabilistic Planner (BOPP)1, a novel planner that combines elements of Decision Theoretic Planning(DTP) and forward search. In particular, BOPP uses a combination of SPUDD and Upper Confidence Trees(UCT). We present our approach and some experimental results on the domains presented in the boolean fluents MDP track of the International Probabilistic Planning Competition(IPPC) 2011. Decision Theoretic Planning (DTP) (Boutilier, Dean, and Hanks 1999) is a well established method for solving probabilistic planning domains by casting them as Markov Decision Processes (MDP) and generating a policy. Classical solutions to DTP (Bellman 1957; Howard 1960) require the entire state space to be enumerated. This approach is usually infeasible for solving planning problems of interest. Recent advances in DTP have mitigated this effect by providing solution algorithms for factored (Boutilier, Dearden, and Goldszmidt 1999) and relational (Boutilier, Reiter, and Price 2001) MDPs. These solutions are abstract and require enumeration only of the relevant conditions that create a partition of the state space into equivalence classes (based on the policy or value), instead of enumerating the entire state space. One factored MDP solver in particular, SPUDD (Hoey et al. 1999), has been very successful at solving planning problems and has spawned numerous variants over the last decade. SPUDD employs Algebraic Decision Diagrams to represent and solve the underlying MDP using value iteration. However, experiments have shown that SPUDD proves to be inefficient for many of the planning problems presented in the recent IPPC. Forward Search has been another classic approach for AI planning. Brute force search is typically infeasible for large state spaces because the size of the search tree is exponential in the depth (length of the plan). Planners based on heuristic search, however, have shown success at the recent planning competitions (Bonet and Geffner 2001; Yoon, Fern, and Givan 2007; Teichteil-Koenigsbuch, Infantes, and Kuter 2008). The heuristic values of states or state-action pairs are typically derived automatically by solving a relaxation to the probabilistic planning problem. More recently, search algorithms for probabilistic planning based on simulation and
منابع مشابه
Probabilistic Backward and Forward Reasoning in Stochastic Relational Worlds
Inference in graphical models has emerged as a promising technique for planning. A recent approach to decision-theoretic planning in relational domains uses forward inference in dynamic Bayesian networks compiled from learned probabilistic relational rules. Inspired by work in non-relational domains with small state spaces, we derive a backpropagation method for such nets in relational domains ...
متن کاملA smoothing strategy for prm paths: application to 6-axes motoman manipulator
This paper describes the use of the probabilistic motion planning technique SBL “Single-Query Bidirectional Probabilistic Algorithm with Lazy Collision Checking” or in motion planning for robot manipulators. We present a novel strategy to remedy the PRM “Probabilistic Roadmap” paths which are both excessively long and velocity discontinuous. The optimization of the path will be done first throu...
متن کاملBidirectional Fast Marching Trees: An Optimal Sampling-Based Algorithm for Bidirectional Motion Planning
In this paper, we present the Bi-directional FMT∗(BFMT∗) algorithm for asymptotically-optimal, sampling-based path planning in cluttered, high-dimensional spaces. Specifically, BFMT∗ performs a twosource, lazy dynamic programming recursion over a set of randomlydrawn samples, correspondingly generating two search trees: one in costto-come space from the initial state and another in cost-to-go s...
متن کاملTask Space Regions: A framework for pose-constrained manipulation planning
We present a manipulation planning framework that allows robots to plan in the presence of constraints on end-effector pose, as well as other common constraints. The framework has three main components: constraint representation, constraintsatisfaction strategies, and a general planning algorithm. These components come together to create an efficient and probabilistically complete manipulation ...
متن کاملMultiple query probabilistic roadmap planning using single query planning primitives
We propose a combination of techniques that solve multiple queries for motion planning problems with single query planners. Our implementation uses a probabilistic roadmap method (PRM) with bidirectional rapidly exploring random trees (BI-RRT) as the local planner. With small modifications to the standard algorithms, we obtain a multiple query planner which is significantly faster and more reli...
متن کامل